Document Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
نویسنده
چکیده مقاله:
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a low cost transformation to project curved area to 2-D rectangular area. We evaluate the performance of the proposed methods in combination with three participating methods on the public DFKI-I dataset (CBDAR 2007 dewarping contest), which contains camera-captured document images. Experimental results indicate the effectiveness and superiority of the proposed method.
منابع مشابه
Document Image Dewarping Contest
Dewarping of documents captured with hand-held cameras in an uncontrolled environment has triggered a lot of interest in the scientific community over the last few years and many approaches have been proposed. However, there has been no comparative evaluation of different dewarping techniques so far. In an attempt to fill this gap, we have organized a page dewarping contest along with CBDAR 200...
متن کاملA Model-based Book Dewarping Method Using Text Line Detection
In this paper, we propose a book dewarping model based on the assumption that the book surface is warped as a cylinder. This model extends the model proposed by Cao and makes Cao’s model a special case of our model. This extension removes the constraint of Cao’s model that the camera lens must be strictly parallel to the book surface, which is hard to make in practice, therefore enables a user ...
متن کاملRestoring Warped Document Image Based on Text Line Correction
Abstract Document images captured by camera often suffer from warping and distortions because of the bounded volumes and complex environment light source. These effects not only reduce the document readability but also the OCR recognition performance. In this paper, we propose a method to combine non-linear and linear compensation for correcting distortions of document images. First, due to the...
متن کاملImage dewarping and text extraction from mobile captured distinct documents
Camera Based Document Analysis (CBDA) is an emerging field in computer vision and pattern recognition. In recent days, cameras are moulded with several items of additional equipment. Thus, they play a vital role in the replacement of scanners with hand-held imaging devices (HIDs) like digital cameras, mobile phones and gaming devices. Warping is a common appearance in camera captured document i...
متن کاملPerformance evaluation methodology for document image dewarping techniques
The performance evaluation of dewarping techniques is currently addressed by concentrating in visual pleasing impressions or by using optical character recognition (OCR) as a means for indirect evaluation. In this study, the authors present a performance evaluation methodology that calculates a comprehensive evaluation measure which reflects the entire performance of a dewarping technique in a ...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ذخیره در منابع من قبلا به منابع من ذحیره شده{@ msg_add @}
عنوان ژورنال
دوره 27 شماره 12
صفحات 1855- 1862
تاریخ انتشار 2014-12-01
با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023